Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 419403 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.0 MiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 10 |
|---|
Date is highly correlated with Biomass_MW | High correlation |
Consumption_MW is highly correlated with Coal_MW and 2 other fields | High correlation |
Coal_MW is highly correlated with Consumption_MW | High correlation |
Gas_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Biomass_MW is highly correlated with Date | High correlation |
Production_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Date is highly correlated with Wind_MW and 1 other fields | High correlation |
Consumption_MW is highly correlated with Coal_MW and 2 other fields | High correlation |
Coal_MW is highly correlated with Consumption_MW | High correlation |
Gas_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Wind_MW is highly correlated with Date | High correlation |
Biomass_MW is highly correlated with Date | High correlation |
Production_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Date is highly correlated with Biomass_MW | High correlation |
Consumption_MW is highly correlated with Production_MW | High correlation |
Biomass_MW is highly correlated with Date | High correlation |
Production_MW is highly correlated with Consumption_MW | High correlation |
Consumption_MW is highly correlated with Coal_MW and 1 other fields | High correlation |
Biomass_MW is highly correlated with Date | High correlation |
Coal_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Date is highly correlated with Biomass_MW and 1 other fields | High correlation |
Production_MW is highly correlated with Consumption_MW and 1 other fields | High correlation |
Wind_MW is highly correlated with Date | High correlation |
Wind_MW has 27315 (6.5%) zeros | Zeros |
Solar_MW has 219493 (52.3%) zeros | Zeros |
Biomass_MW has 183783 (43.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-09 22:52:21.885659 |
|---|---|
| Analysis finished | 2021-09-09 22:52:41.447349 |
| Duration | 19.56 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 419398 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1390064137 |
| Minimum | 1262487660 |
|---|---|
| Maximum | 1514947775 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 1262487660 |
|---|---|
| 5-th percentile | 1275541800 |
| Q1 | 1327113841 |
| median | 1391037353 |
| Q3 | 1453085107 |
| 95-th percentile | 1502572105 |
| Maximum | 1514947775 |
| Range | 252460115 |
| Interquartile range (IQR) | 125971266 |
Descriptive statistics
| Standard deviation | 72778786.87 |
|---|---|
| Coefficient of variation (CV) | 0.05235642366 |
| Kurtosis | -1.197368969 |
| Mean | 1390064137 |
| Median Absolute Deviation (MAD) | 62977612 |
| Skewness | -0.02476660488 |
| Sum | 5.829970692 × 1014 |
| Variance | 5.296751818 × 1015 |
| Monotonicity | Increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1268536740 | 2 | < 0.1% |
| 1268537940 | 2 | < 0.1% |
| 1268537340 | 2 | < 0.1% |
| 1268538480 | 2 | < 0.1% |
| 1268536080 | 2 | < 0.1% |
| 1440749565 | 1 | < 0.1% |
| 1416701517 | 1 | < 0.1% |
| 1464498658 | 1 | < 0.1% |
| 1326074340 | 1 | < 0.1% |
| 1413110246 | 1 | < 0.1% |
| Other values (419388) | 419388 |
| Value | Count | Frequency (%) |
| 1262487660 | 1 | |
| 1262488200 | 1 | |
| 1262488800 | 1 | |
| 1262489400 | 1 | |
| 1262490060 | 1 | |
| 1262490660 | 1 | |
| 1262491260 | 1 | |
| 1262492400 | 1 | |
| 1262493000 | 1 | |
| 1262493660 | 1 |
| Value | Count | Frequency (%) |
| 1514947775 | 1 | |
| 1514947185 | 1 | |
| 1514946595 | 1 | |
| 1514946005 | 1 | |
| 1514945415 | 1 | |
| 1514944825 | 1 | |
| 1514944235 | 1 | |
| 1514943645 | 1 | |
| 1514943055 | 1 | |
| 1514942465 | 1 |
Consumption_MW
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5515 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6608.54512 |
| Minimum | 44 |
|---|---|
| Maximum | 26209 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 44 |
|---|---|
| 5-th percentile | 5078 |
| Q1 | 5834 |
| median | 6578 |
| Q3 | 7279 |
| 95-th percentile | 8388 |
| Maximum | 26209 |
| Range | 26165 |
| Interquartile range (IQR) | 1445 |
Descriptive statistics
| Standard deviation | 1007.541019 |
|---|---|
| Coefficient of variation (CV) | 0.1524603374 |
| Kurtosis | -0.04993645558 |
| Mean | 6608.54512 |
| Median Absolute Deviation (MAD) | 724 |
| Skewness | 0.2620643423 |
| Sum | 2771643649 |
| Variance | 1015138.904 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 6588 | 347 | 0.1% |
| 6482 | 255 | 0.1% |
| 6722 | 215 | 0.1% |
| 6686 | 203 | < 0.1% |
| 6727 | 202 | < 0.1% |
| 6795 | 202 | < 0.1% |
| 6542 | 201 | < 0.1% |
| 6581 | 201 | < 0.1% |
| 6770 | 199 | < 0.1% |
| 6578 | 198 | < 0.1% |
| Other values (5505) | 417180 |
| Value | Count | Frequency (%) |
| 44 | 1 | |
| 47 | 1 | |
| 94 | 1 | |
| 95 | 1 | |
| 3666 | 1 | |
| 3667 | 1 | |
| 3698 | 1 | |
| 3709 | 1 | |
| 3713 | 1 | |
| 3714 | 2 |
| Value | Count | Frequency (%) |
| 26209 | 1 | |
| 21007 | 1 | |
| 9865 | 1 | |
| 9826 | 1 | |
| 9807 | 1 | |
| 9784 | 1 | |
| 9766 | 1 | |
| 9739 | 1 | |
| 9729 | 1 | |
| 9721 | 1 |
| Distinct | 3840 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2257.775364 |
| Minimum | -485 |
|---|---|
| Maximum | 5702 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 2 |
| Negative (%) | < 0.1% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -485 |
|---|---|
| 5-th percentile | 1346 |
| Q1 | 1834 |
| median | 2196 |
| Q3 | 2652.5 |
| 95-th percentile | 3340 |
| Maximum | 5702 |
| Range | 6187 |
| Interquartile range (IQR) | 818.5 |
Descriptive statistics
| Standard deviation | 610.8365915 |
|---|---|
| Coefficient of variation (CV) | 0.2705479922 |
| Kurtosis | -0.08130149096 |
| Mean | 2257.775364 |
| Median Absolute Deviation (MAD) | 403 |
| Skewness | 0.309782812 |
| Sum | 946917761 |
| Variance | 373121.3415 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2593 | 395 | 0.1% |
| 2088 | 344 | 0.1% |
| 2040 | 342 | 0.1% |
| 2073 | 340 | 0.1% |
| 1917 | 338 | 0.1% |
| 2150 | 335 | 0.1% |
| 2001 | 334 | 0.1% |
| 2134 | 334 | 0.1% |
| 2105 | 333 | 0.1% |
| 2081 | 333 | 0.1% |
| Other values (3830) | 415975 |
| Value | Count | Frequency (%) |
| -485 | 1 | < 0.1% |
| -43 | 1 | < 0.1% |
| 50 | 2 | |
| 357 | 1 | < 0.1% |
| 358 | 4 | |
| 359 | 4 | |
| 360 | 2 | |
| 362 | 3 | |
| 364 | 4 | |
| 365 | 2 |
| Value | Count | Frequency (%) |
| 5702 | 1 | |
| 5338 | 1 | |
| 4408 | 1 | |
| 4395 | 1 | |
| 4383 | 1 | |
| 4370 | 1 | |
| 4369 | 1 | |
| 4368 | 1 | |
| 4365 | 1 | |
| 4353 | 1 |
| Distinct | 2300 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1008.151282 |
| Minimum | -414 |
|---|---|
| Maximum | 2666 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -414 |
|---|---|
| 5-th percentile | 366 |
| Q1 | 604 |
| median | 984 |
| Q3 | 1329 |
| 95-th percentile | 1871 |
| Maximum | 2666 |
| Range | 3080 |
| Interquartile range (IQR) | 725 |
Descriptive statistics
| Standard deviation | 469.6586672 |
|---|---|
| Coefficient of variation (CV) | 0.4658613005 |
| Kurtosis | -0.5246681608 |
| Mean | 1008.151282 |
| Median Absolute Deviation (MAD) | 365 |
| Skewness | 0.4147662848 |
| Sum | 422821672 |
| Variance | 220579.2637 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 416 | 778 | 0.2% |
| 417 | 702 | 0.2% |
| 415 | 638 | 0.2% |
| 400 | 592 | 0.1% |
| 418 | 576 | 0.1% |
| 398 | 569 | 0.1% |
| 397 | 567 | 0.1% |
| 424 | 564 | 0.1% |
| 399 | 556 | 0.1% |
| 423 | 553 | 0.1% |
| Other values (2290) | 413308 |
| Value | Count | Frequency (%) |
| -414 | 1 | < 0.1% |
| 0 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 116 | 1 | < 0.1% |
| 120 | 9 | |
| 121 | 2 | < 0.1% |
| 128 | 1 | < 0.1% |
| 129 | 8 | |
| 130 | 13 |
| Value | Count | Frequency (%) |
| 2666 | 1 | |
| 2660 | 1 | |
| 2659 | 1 | |
| 2651 | 1 | |
| 2506 | 1 | |
| 2504 | 2 | |
| 2501 | 1 | |
| 2499 | 1 | |
| 2463 | 1 | |
| 2458 | 2 |
Hidroelectric_MW
Real number (ℝ≥0)
| Distinct | 4221 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1840.614073 |
| Minimum | 0 |
|---|---|
| Maximum | 4728 |
| Zeros | 21 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 669 |
| Q1 | 1262 |
| median | 1809 |
| Q3 | 2395 |
| 95-th percentile | 3100 |
| Maximum | 4728 |
| Range | 4728 |
| Interquartile range (IQR) | 1133 |
Descriptive statistics
| Standard deviation | 754.0300395 |
|---|---|
| Coefficient of variation (CV) | 0.4096622158 |
| Kurtosis | -0.564910179 |
| Mean | 1840.614073 |
| Median Absolute Deviation (MAD) | 566 |
| Skewness | 0.1825489605 |
| Sum | 771959064 |
| Variance | 568561.3005 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2476 | 359 | 0.1% |
| 1550 | 239 | 0.1% |
| 1573 | 238 | 0.1% |
| 2012 | 232 | 0.1% |
| 1621 | 229 | 0.1% |
| 1913 | 229 | 0.1% |
| 1435 | 228 | 0.1% |
| 1505 | 228 | 0.1% |
| 1649 | 227 | 0.1% |
| 1384 | 226 | 0.1% |
| Other values (4211) | 416968 |
| Value | Count | Frequency (%) |
| 0 | 21 | |
| 53 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 58 | 1 | < 0.1% |
| 59 | 2 | < 0.1% |
| 60 | 3 | < 0.1% |
| 85 | 1 | < 0.1% |
| 87 | 2 | < 0.1% |
| 90 | 1 | < 0.1% |
| 91 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 4728 | 1 | |
| 4706 | 1 | |
| 4700 | 1 | |
| 4692 | 1 | |
| 4687 | 1 | |
| 4680 | 1 | |
| 4668 | 1 | |
| 4664 | 1 | |
| 4663 | 1 | |
| 4656 | 1 |
Nuclear_MW
Real number (ℝ≥0)
| Distinct | 879 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1320.407794 |
| Minimum | 0 |
|---|---|
| Maximum | 1450 |
| Zeros | 129 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 702 |
| Q1 | 1377 |
| median | 1403 |
| Q3 | 1419 |
| 95-th percentile | 1427 |
| Maximum | 1450 |
| Range | 1450 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 223.1816069 |
|---|---|
| Coefficient of variation (CV) | 0.1690247573 |
| Kurtosis | 4.082766423 |
| Mean | 1320.407794 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | -2.410152883 |
| Sum | 553782990 |
| Variance | 49810.02967 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1421 | 11618 | 2.8% |
| 1422 | 11318 | 2.7% |
| 1423 | 11129 | 2.7% |
| 1420 | 11038 | 2.6% |
| 1419 | 10593 | 2.5% |
| 1424 | 10450 | 2.5% |
| 1418 | 9729 | 2.3% |
| 1425 | 9619 | 2.3% |
| 1417 | 8851 | 2.1% |
| 1426 | 8417 | 2.0% |
| Other values (869) | 316641 |
| Value | Count | Frequency (%) |
| 0 | 129 | |
| 37 | 1 | < 0.1% |
| 39 | 4 | < 0.1% |
| 40 | 3 | < 0.1% |
| 44 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 83 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1450 | 1 | < 0.1% |
| 1443 | 1 | < 0.1% |
| 1441 | 3 | < 0.1% |
| 1440 | 11 | < 0.1% |
| 1439 | 13 | < 0.1% |
| 1438 | 29 | < 0.1% |
| 1437 | 52 | < 0.1% |
| 1436 | 113 | < 0.1% |
| 1435 | 252 | |
| 1434 | 374 |
| Distinct | 2828 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 501.8796504 |
| Minimum | -521 |
|---|---|
| Maximum | 7944 |
| Zeros | 27315 |
| Zeros (%) | 6.5% |
| Negative | 11515 |
| Negative (%) | 2.7% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -521 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 64 |
| median | 272 |
| Q3 | 733 |
| 95-th percentile | 1804 |
| Maximum | 7944 |
| Range | 8465 |
| Interquartile range (IQR) | 669 |
Descriptive statistics
| Standard deviation | 585.6012027 |
|---|---|
| Coefficient of variation (CV) | 1.166815993 |
| Kurtosis | 1.764636182 |
| Mean | 501.8796504 |
| Median Absolute Deviation (MAD) | 247 |
| Skewness | 1.517715362 |
| Sum | 210489831 |
| Variance | 342928.7686 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 27315 | 6.5% |
| -1 | 3060 | 0.7% |
| -2 | 2222 | 0.5% |
| 3 | 1655 | 0.4% |
| 1 | 1559 | 0.4% |
| 2 | 1531 | 0.4% |
| 5 | 1390 | 0.3% |
| 4 | 1383 | 0.3% |
| -3 | 1379 | 0.3% |
| 6 | 1332 | 0.3% |
| Other values (2818) | 376577 |
| Value | Count | Frequency (%) |
| -521 | 2 | < 0.1% |
| -26 | 3 | < 0.1% |
| -25 | 18 | < 0.1% |
| -24 | 32 | < 0.1% |
| -23 | 34 | |
| -22 | 57 | |
| -21 | 41 | |
| -20 | 54 | |
| -19 | 52 | |
| -18 | 84 |
| Value | Count | Frequency (%) |
| 7944 | 1 | |
| 2806 | 1 | |
| 2803 | 1 | |
| 2802 | 1 | |
| 2800 | 2 | |
| 2799 | 1 | |
| 2798 | 2 | |
| 2797 | 1 | |
| 2796 | 1 | |
| 2795 | 1 |
| Distinct | 857 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.44516849 |
| Minimum | -6 |
|---|---|
| Maximum | 859 |
| Zeros | 219493 |
| Zeros (%) | 52.3% |
| Negative | 78275 |
| Negative (%) | 18.7% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | -6 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 0 |
| median | 0 |
| Q3 | 16 |
| 95-th percentile | 492 |
| Maximum | 859 |
| Range | 865 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 161.3681006 |
|---|---|
| Coefficient of variation (CV) | 2.290690817 |
| Kurtosis | 5.507535705 |
| Mean | 70.44516849 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.512139247 |
| Sum | 29544915 |
| Variance | 26039.66388 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 219493 | |
| -1 | 78214 | 18.6% |
| 1 | 3468 | 0.8% |
| 2 | 1704 | 0.4% |
| 3 | 1322 | 0.3% |
| 4 | 1189 | 0.3% |
| 5 | 1011 | 0.2% |
| 6 | 915 | 0.2% |
| 7 | 912 | 0.2% |
| 8 | 854 | 0.2% |
| Other values (847) | 110321 |
| Value | Count | Frequency (%) |
| -6 | 1 | < 0.1% |
| -4 | 8 | < 0.1% |
| -3 | 41 | < 0.1% |
| -2 | 11 | < 0.1% |
| -1 | 78214 | 18.6% |
| 0 | 219493 | |
| 1 | 3468 | 0.8% |
| 2 | 1704 | 0.4% |
| 3 | 1322 | 0.3% |
| 4 | 1189 | 0.3% |
| Value | Count | Frequency (%) |
| 859 | 1 | |
| 858 | 1 | |
| 854 | 1 | |
| 852 | 2 | |
| 851 | 1 | |
| 848 | 1 | |
| 847 | 1 | |
| 845 | 1 | |
| 844 | 1 | |
| 843 | 1 |
Biomass_MW
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 101 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.36559347 |
| Minimum | 0 |
|---|---|
| Maximum | 110 |
| Zeros | 183783 |
| Zeros (%) | 43.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 33 |
| Q3 | 55 |
| 95-th percentile | 67 |
| Maximum | 110 |
| Range | 110 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 27.42416309 |
|---|---|
| Coefficient of variation (CV) | 0.9338875821 |
| Kurtosis | -1.713833507 |
| Mean | 29.36559347 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 0.03615239414 |
| Sum | 12316018 |
| Variance | 752.0847211 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 183783 | |
| 33 | 14475 | 3.5% |
| 57 | 9318 | 2.2% |
| 52 | 9217 | 2.2% |
| 55 | 9201 | 2.2% |
| 58 | 9046 | 2.2% |
| 61 | 8868 | 2.1% |
| 64 | 7630 | 1.8% |
| 56 | 7551 | 1.8% |
| 53 | 7520 | 1.8% |
| Other values (91) | 152794 |
| Value | Count | Frequency (%) |
| 0 | 183783 | |
| 10 | 3 | < 0.1% |
| 11 | 33 | < 0.1% |
| 12 | 72 | < 0.1% |
| 13 | 14 | < 0.1% |
| 14 | 26 | < 0.1% |
| 15 | 13 | < 0.1% |
| 16 | 104 | < 0.1% |
| 17 | 230 | 0.1% |
| 18 | 68 | < 0.1% |
| Value | Count | Frequency (%) |
| 110 | 4 | < 0.1% |
| 109 | 14 | |
| 108 | 11 | |
| 107 | 11 | |
| 106 | 3 | < 0.1% |
| 105 | 5 | < 0.1% |
| 104 | 3 | < 0.1% |
| 103 | 3 | < 0.1% |
| 102 | 3 | < 0.1% |
| 100 | 1 | < 0.1% |
| Distinct | 6936 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7028.770679 |
| Minimum | 0 |
|---|---|
| Maximum | 11295 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5208 |
| Q1 | 6177 |
| median | 6973 |
| Q3 | 7820 |
| 95-th percentile | 9058 |
| Maximum | 11295 |
| Range | 11295 |
| Interquartile range (IQR) | 1643 |
Descriptive statistics
| Standard deviation | 1169.322621 |
|---|---|
| Coefficient of variation (CV) | 0.1663623235 |
| Kurtosis | -0.3185947843 |
| Mean | 7028.770679 |
| Median Absolute Deviation (MAD) | 821 |
| Skewness | 0.2301559273 |
| Sum | 2947887509 |
| Variance | 1367315.393 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 7001 | 322 | 0.1% |
| 7066 | 252 | 0.1% |
| 6697 | 177 | < 0.1% |
| 6840 | 176 | < 0.1% |
| 6696 | 174 | < 0.1% |
| 6654 | 174 | < 0.1% |
| 6673 | 169 | < 0.1% |
| 6732 | 169 | < 0.1% |
| 6734 | 168 | < 0.1% |
| 6566 | 167 | < 0.1% |
| Other values (6926) | 417455 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 44 | 1 | |
| 47 | 1 | |
| 50 | 1 | |
| 744 | 1 | |
| 936 | 1 | |
| 3616 | 1 | |
| 3621 | 1 | |
| 3671 | 1 | |
| 3675 | 1 |
| Value | Count | Frequency (%) |
| 11295 | 1 | |
| 11227 | 1 | |
| 11219 | 1 | |
| 11205 | 1 | |
| 11183 | 1 | |
| 11153 | 1 | |
| 11150 | 1 | |
| 11146 | 1 | |
| 11138 | 1 | |
| 11131 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Date | Consumption_MW | Coal_MW | Gas_MW | Hidroelectric_MW | Nuclear_MW | Wind_MW | Solar_MW | Biomass_MW | Production_MW | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1262487660 | 5302.0 | 1754.0 | 1144.0 | 1391.0 | 706.0 | 0.0 | 0.0 | 0.0 | 4995.0 |
| 1 | 1262488200 | 5318.0 | 1777.0 | 1145.0 | 1468.0 | 708.0 | 0.0 | 0.0 | 0.0 | 5097.0 |
| 2 | 1262488800 | 5268.0 | 1743.0 | 1139.0 | 1361.0 | 708.0 | 0.0 | 0.0 | 0.0 | 4951.0 |
| 3 | 1262489400 | 5358.0 | 1759.0 | 1142.0 | 1449.0 | 707.0 | 0.0 | 0.0 | 0.0 | 5057.0 |
| 4 | 1262490060 | 5327.0 | 1764.0 | 1142.0 | 1417.0 | 709.0 | 0.0 | 0.0 | 0.0 | 5031.0 |
| 5 | 1262490660 | 5307.0 | 1771.0 | 1142.0 | 1418.0 | 706.0 | 0.0 | 0.0 | 0.0 | 5037.0 |
| 6 | 1262491260 | 5256.0 | 1752.0 | 1153.0 | 1368.0 | 712.0 | 0.0 | 0.0 | 0.0 | 4985.0 |
| 7 | 1262492400 | 5308.0 | 1762.0 | 1151.0 | 1461.0 | 709.0 | 0.0 | 0.0 | 0.0 | 5083.0 |
| 8 | 1262493000 | 5426.0 | 1785.0 | 1153.0 | 1515.0 | 704.0 | 0.0 | 0.0 | 0.0 | 5157.0 |
| 9 | 1262493660 | 5340.0 | 1782.0 | 1150.0 | 1488.0 | 706.0 | 0.0 | 0.0 | 0.0 | 5126.0 |
Last rows
| Date | Consumption_MW | Coal_MW | Gas_MW | Hidroelectric_MW | Nuclear_MW | Wind_MW | Solar_MW | Biomass_MW | Production_MW | |
|---|---|---|---|---|---|---|---|---|---|---|
| 419393 | 1514942465 | 7307.0 | 2328.0 | 955.0 | 1311.0 | 1404.0 | 2269.0 | -1.0 | 43.0 | 8308.0 |
| 419394 | 1514943055 | 7295.0 | 2250.0 | 941.0 | 1335.0 | 1404.0 | 2244.0 | -1.0 | 44.0 | 8217.0 |
| 419395 | 1514943645 | 7272.0 | 2253.0 | 946.0 | 1395.0 | 1404.0 | 2205.0 | -1.0 | 45.0 | 8246.0 |
| 419396 | 1514944235 | 7266.0 | 2239.0 | 946.0 | 1387.0 | 1406.0 | 2204.0 | -1.0 | 44.0 | 8224.0 |
| 419397 | 1514944825 | 7287.0 | 2275.0 | 945.0 | 1425.0 | 1401.0 | 2193.0 | -1.0 | 45.0 | 8283.0 |
| 419398 | 1514945415 | 7262.0 | 2279.0 | 942.0 | 1444.0 | 1403.0 | 2175.0 | -1.0 | 45.0 | 8287.0 |
| 419399 | 1514946005 | 7167.0 | 2259.0 | 943.0 | 1383.0 | 1405.0 | 2174.0 | -1.0 | 43.0 | 8207.0 |
| 419400 | 1514946595 | 7122.0 | 2251.0 | 945.0 | 1362.0 | 1405.0 | 2159.0 | -1.0 | 45.0 | 8165.0 |
| 419401 | 1514947185 | 7264.0 | 2288.0 | 944.0 | 1454.0 | 1406.0 | 2132.0 | -1.0 | 45.0 | 8268.0 |
| 419402 | 1514947775 | 7115.0 | 2255.0 | 944.0 | 1370.0 | 1404.0 | 2131.0 | -1.0 | 41.0 | 8145.0 |